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1. INTRODUCTION 

Software products play a crucial role in enhancing productivity within software development. This 
association stems from the imperative for software to effectively address the needs of end users, resulting in 
profound implications not only for the economy [1] but also across various dimensions encompassing 
technical and social considerations [2], [3], emotional influences (such as affecting moods and feelings) [4], 
work-life dynamics [5], [6], culture [7], and collaborative teamwork [8], [9]. Notably, software quality has 
exhibited a progressive upswing over time, primarily attributable to the implementation of enhanced 
requirements management practices [10] and the active engagement of users throughout the development 
process. However, user-involved development necessitates deeper understanding and integration with system 
users' thoughts and emotional experiences [11]. 

In software development, developers employ personas to delineate target user groups and gain 
insights into the characteristics and requirements of system users. Personas are considered an effective 
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approach for depicting user attributes and desired objectives [12]. Prior research has demonstrated that 
persona data collection typically involves qualitative and quantitative techniques to facilitate an in-depth 
exploration of user understanding and creating representative personas [13], [14]. Nevertheless, qualitative 
methods have limitations stemming from their reliance on the cognitive abilities of the development team 
[15]. In contrast, quantitative methods may need more contextual details and depth of qualitative 
approaches [16]. 

Previous studies propose an alternative approach to characterizing users through using personas in 
high-level conceptual design, aligning with user-centered development (UCD) methodologies [17]. Within 
the context of design thinking, personas can be employed to engage users in contemporary development 
approaches [18]. The persona method encompasses various activities, such as collecting, analyzing, and 
synthesizing information about the users interacting with the software under development [17]. 

However, developers need help concerning the proficiency gap between novice and professional 
developers. Additionally, the user persona method becomes problematic when the integration process 
becomes overly intricate and subjectively determined by the developer [19]. Furthermore, the identification 
process must account for the developer's characteristics and capabilities to design and implement usable 
systems [20], [21]. Consequently, personas become an integral aspect of exploring software requirements. 

The persona concept has garnered attention from researchers aiming to offer practical solutions for 
describing user interactions that can be represented through applications, reflecting user goals [17], [20], 
[22]. However, personas have been criticized for being abstract, subjective, and misleading [23]. Several 
personas have been developed to address this issue, including the holistic persona with five dimensions: 
factual, personality, intelligence, knowledge, and cognitive process [24]. To assess the effectiveness of 
holistic personas in determining user needs and supporting conceptual design, a series of empirical studies 
have been designed and conducted. These studies aim to investigate whether including personality traits in 
the interactions between designers and personas leads to variations in conceptual design. The persona design 
in this research derives from the user experience (UX) journey framework, which combines user experience 
and requirements to segment users based on their characteristics. Based on the problems and objectives in the 
previous paragraph, this research is expected to provide a framework that novice developers or designers can 
use to fill the gaps in the balance between developers and personas. In addition, developers can identify their 
own abilities to increase productivity. 

The paper follows a well-organized structure that includes a literature review, research questions, 
methodology, results, discussion, and conclusions. The literature review provides an overview of previous 
studies, while the research questions and methodology outline the study's objectives and the approach taken 
to collect and analyze data. The results section presents the findings, supported by visual aids, and the 
discussion section interprets and analyzes the results in the context of the research questions and relevant 
theories. The paper concludes by summarizing the main findings, restating the research questions, discussing 
the implications and contributions of the study, and suggesting potential future research directions. The 
overall structure ensures a logical progression of information and facilitates a comprehensive understanding 
of the study's outcomes. 


2. METHOD 
2.1. Context 

To gain insight into how persona influences the user requirement, we conducted a set of empirical 
studies aimed at addressing the primary research query: do different personas lead to distinct conceptual 
designs customized for each personality? 

In our research, we conducted empirical studies to investigate the influence of personality on the use 
of personas. Our main research question focused on determining whether personas with different 
personalities lead to distinct conceptual designs tailored to each personality. Participants were presented with 
personas characterized by varying personality traits and asked to propose designs addressing the personas' 
identified personal issues. The collected data was analyzed using a sequential mixed-method approach, 
combining qualitative, and quantitative methodologies. Initially, a qualitative analysis was employed to 
categorize the design concepts, followed by quantitative statistical methods to validate or refute the 
hypothesis. The interpretive aspects of the research were guided by grounded theory, which emphasizes 
immersion in the subject matter and data-driven analysis. Thematic analysis was employed to analyze and 
classify the design artifacts generated by participants, involving iterative examination, documentation, and 
reclassification of the data. The utilization of mixed methods and thematic analysis contributed to the 
generation of a comprehensive report and enhanced the validity and relevance of the findings [25]—[28]. 

The research process (Figure 1) began with a thorough competitor analysis to gain insights into the 
existing market landscape, with detail of each activity as given in Table 1. This analysis helped identify 
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competitors’ strengths, weaknesses, and strategies, providing a foundation for further research. Building on 
this analysis, a hypothesis was formulated to guide the investigation. The next step involved identifying 
relevant behavioral variables that would help capture the key aspects of user behavior and preferences. This 
involved considering factors such as user demographics, attitudes, and needs. Subsequently, in-depth 
interviews were conducted with target users to gather qualitative data and map their experiences and 
perspectives. The information obtained from the interviews was then synthesized to identify common 
characteristics and extract relevant goals or objectives of the target user group. Finally, personas were created 
to represent distinct user profiles, incorporating the synthesized data to encapsulate the target users' key 
attributes, motivations, and goals. These personas served as valuable tools for understanding and empathizing 
with the users throughout the design and development process. 


{ 3 
—> ae —— 
Competitor Hypotesis Ident: i 
5 Behavi 
analysis Variables Relevant Goals 


Figure 1. Research process 


Table 1. Set activities purposed for personas (new activities in italics) 


Activities Sub activities Ref. Goal 
Competitor a. A direct and indirect [29] Gain insights and understanding of competitors’ strengths, 
analysis competitor weaknesses, strategies, and market positioning to inform decision- 
b. Strength, weaknesses, making and develop competitive advantages. 
opportunities, and threats 
(SWOT) analysis 
Hypothesis a. Identify possible personas [29] To propose a testable explanation or prediction that guides research 
b. Hypothesis explanation and investigation into a specific phenomenon or relationship. 
Identify a. List of possible behavioral [29] To identify and analyze the specific actions, attitudes, or patterns of 
behavioral variables behavior that are relevant to the research topic or study. 
variables b. Variables explanation 
c. Variables scale 
Map a. Map interview [29] To systematically document and analyze the information gathered 
interview b. Variables explanation from interviews in order to identify key themes, patterns, and insights 
relevant to the research objectives. 
Synthesize a. Characteristics pattern [29] To consolidate and integrate the collected information to identify 
characteristics b. Relevant goal explanation common attributes and objectives that can guide the design and 
and relevant development process. 
goals 
Persona To develop a fictional representation of a target user, which helps to 


understand user needs, behaviors, and preferences in order to design 
products or services that align with their requirements. 


We carried out three empirical studies encompassing three distinct case studies: a booking system 
for planes or trains, technology aimed at children, and technology designed for the elderly. The participants 
in these studies primarily consisted of novice developers currently pursuing studies in computer science at 
our university, with only a few individuals possessing expertise in software requirements and UI/UX. The 
beginner developers were final-year students who had successfully completed the required competency in 
requirements engineering as per the association for computing machinery (ACM) curriculum [30]. The total 
number of participants was 130, although some opted not to work on specific case studies in certain 
instances. The research was conducted on our university campus between late December 2022 and mid- 
February 2023. Each participant voluntarily engaged in all stages of the case studies over a two-week period 
for each individual case study. The participants were provided with general instructions outlining the tasks to 
be performed. For instance, in the case of the technology designed for the elderly, the participants were 
instructed: "you are tasked with providing a design solution (a new application) for a problem faced by the 
elderly. The expected outcome of this project is a useful and suitable solution that can compete with existing 
competitors". Through this instruction, the researchers aimed to encourage participants to explore their 
creativity. All participants worked on their downloaded UX journey worksheets and were requested to 
document their activities, including conducting interviews with potential users, throughout the process [31]. 
After the assignments, each participant was asked to assess the quality of the requirements gathered using the 
requirements matrix as a measure of productivity [31]. However, it should be noted that this measurement did 
not part of the research process, as the data was intended for productivity measurement purposes. 
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2.2. Modelling process as Markov chain 

Numerous studies have been conducted on modeling and analyzing business processes [32], [33]. 
While existing process models in the literature primarily focus on representing the overall software process, 
they are unsuitable for quantitatively determining the similarity between processes or capturing individual 
task steps [32], [33]. The researchers employed Markov chains, commonly used in various domains, to model 
and analyze stochastic processes [34]. Markov chains facilitate the representation of stochastic processes by 
utilizing system states and transitions. In the context of a programmer's task process, each task corresponds to 
a distinct Markov chain, where states represent different conditions, and transition probabilities indicate the 
likelihood of transitioning between states during task execution. The researchers incorporated transition 
matrices to calculate the probability of transitioning from one state to another, enabling quantitative analysis 
and comparison of task processes. 

In the proposed methodology, tasks are essential activities that need to be performed. These tasks 
are part of the UX journey method, as depicted in Figure 1. Developers execute each task to obtain user 
stories and personas. Each task is modeled using the Markov chain model to better understand the system's 
behavior over time. In this model, each state represents a specific condition within the process, and the 
transition probability indicates the likelihood of transitioning from one state to another. This probability 
quantifies the frequency of state transitions relative to the total number of state transfers. A notable aspect of 
employing the Markov chain is its application to the task process. Specifically, the Markov chain captures 
changes in the status or state of the ongoing task process. Each step within the task process corresponds to a 
state in the Markov chain, and the transition probability reflects the likelihood of moving from one step to 
another during the execution of the process. By utilizing the Markov chain model for the task process, it 
becomes possible to analyze and predict the trajectory of the process and comprehend how the preceding step 
influences the subsequent step. For instance, when a developer performs the "competitor analysis" activity, 
there is a probability of subsequently undertaking the "hypothesis" activity or the "identify behavioral 
variables" activity. The probability of transitioning from one activity to another is determined by the ratio of 
the number of times the developer performs the transition from the first activity to the second activity within 
the task process to the total number of times the developer advances from the third step and beyond. 


2.3. Analysis difference between case study 

By utilizing the Markov task process model, the disparities between the two task processes can be 
elucidated by evaluating the dissimilarity between their respective Markov chains. The Markov chain 
represents alterations in the state or condition of an ongoing task process. Within this framework, each 
individual step or activity within the task process is regarded as a distinct state within the Markov chain. At 
the same time, the transition probability denotes the likelihood of transitioning from one step to another 
during the execution of the process. The extent of divergence between the two Markov chains can be 
ascertained by scrutinizing the dissimilarity between the two task processes. This dissimilarity is gauged by 
comparing the transition probabilities and states manifesting at each Markov chain step. A more pronounced 
distinction in transition probabilities and states between the two Markov chains indicates a greater disparity 
between the task processes. Consequently, the Markov task process model is invaluable for quantitatively 
evaluating and measuring the variations between two task processes. The disparity between the two Markov 
chains is calculated using the state transition matrix and determining the sum of absolute differences by 
calculating the Manhattan distance. 


3. RESULTS AND DISCUSSION 

The outcomes of the comprehensive persona, including a user requirements matrix intended for 
productivity testing, are publicly accessible in the Indonesian language [35]. The discussion section 
encompasses multiple subsections, including illustrations of the persona creation process, a statistical 
analysis of the three case studies, an examination of activity similarities, and an assessment of their impact on 
productivity. 


3.1. Persona creation process 

In cooper's study, later updated by Silvia, persona creation was identified as an open and structured 
activity embedded within a sequence of tasks that involve specific objectives, techniques, and deliverables. 
However, previous publicly available empirical research suggests that developers need help in formulating 
hypotheses within the context of creative thinking. As part of hypothesis generation, participants were 
instructed to identify competitors using two assessment methods: SWOT analysis and competitor analysis. 
The provided worksheet includes a set of questions for SWOT analysis to assist participants in identifying 
each component. The competitor analysis section consists of three components: direct competitors, indirect 
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competitors, and feedback from users or developers. Tables 2 and 3 present illustrative examples of 
competitor analysis using case studies of train ticket booking systems. 


Table 2. Example of SWOT analysis 


SWOT Questions Example answer 
Strength Advantages? The train booking system offers a user-friendly interface and seamless integration with multiple 
payment options for convenient transactions. 

Uniqueness? The train booking system stands out with its exclusive partnerships with major railway 
companies, providing access to exclusive train routes and discounts. 

Selling points? The train booking system's selling points include competitive pricing, comprehensive customer 
support, and additional services like seat selection and travel insurance options. 

Skills? The train booking system demonstrates its expertise through skilled customer service 
representatives, experienced software developers, and proficiency in data analytics. 

Other factors? The train booking system stands out with its eco-friendly initiatives, accessibility features, and 
integration with other transportation modes for seamless travel options. 

Weaknesses Limitations? The train booking system has limitations in terms of the limited availability of regional train 
services, which may result in fewer options for certain destinations. 

Lack of effort? The train booking system lacks sufficient effort in providing real-time updates on train delays or 
cancellations, leading to potential inconvenience for users. 

Problems? The train booking system experiences occasional technical glitches, such as slow loading times or 
errors during the booking process, impacting user experience. 

Poor strategy? The train booking system lacks a clear marketing strategy to effectively target and attract a 
diverse range of customers, potentially limiting its user base. 

Other factors? The train booking system faces challenges in adapting to changing customer preferences and 
demands, such as the integration of emerging technologies or personalized travel 
recommendations. 

Opportunities Improvements? Providing a more convenient and accessible booking experience for users. 

Performance? By optimizing the system's performance and reducing response times, the train booking system 
can offer a faster and more efficient booking process, improving overall customer satisfaction. 

Opportunities? Explore partnerships with travel agencies or other transportation services to offer bundled travel 
packages, expanding its range of services and attracting a wider customer base. 

Consumer Promote its environmentally friendly features, such as reduced carbon emissions, to attract 

behavior? environmentally conscious travelers. 

Other factors? Leveraging emerging technologies, such as artificial intelligence and machine learning, the train 
booking system can provide personalized recommendations and tailored travel experiences, 
increasing customer engagement and loyalty. 

Threats External Strikes or infrastructure issues pose a threat to the reliability and availability of the train booking 
trouble? system. 

Obstacles? Increased competition from alternative modes of transportation. 

Trends? Leading to decreased usage and revenue because of the growing popularity of online travel 
agencies and aggregator. 

Other factors? Changes in fare structures or restrictions can pose challenges and uncertainties for the train 
booking system's operations and profitability. 

Table 3. Example of competitor analysis 

Direct competitor Indirect competitor 
kai.id Skyscanner 
traveloka.com Booking.com 
TrainBooker Expedia 
Trainline Kayak 
Omio Rome2rio 
GoEuro 


User reviews 


Direct competitor app - '"TrainBooker": 

Positive review: "TrainBooker is incredibly user-friendly and offers many train options. The booking process is seamless, and I love 
how I can easily compare prices and schedules. Highly recommended!" 

Negative review: "I had some issues with the TrainBooker app. The search results were not always accurate, and I encountered 
errors when completing my booking. Customer support was slow to respond, which was frustrating". 

Indirect competitor app - "Skyscanner": 

Positive review: "Skyscanner is my go-to app for travel bookings, and their train ticket feature is a great addition. Having all my 
travel options in one place is convenient, and the app always delivers reliable results". 

Negative review: "I found the train ticket booking feature on Skyscanner to be a bit lacking. The search results were limited, and 
sometimes the prices displayed were outdated. It would be great to see more improvement in this area". 


SWOT and competitor analysis, depicted in Tables 2 and 3, offers a comprehensive assessment of 
the existing market landscape. Consistent with prior research, the creative thinking process begins with broad 
considerations and potential divergent thinking, generating various design solutions that are refined into a 
primary solution through convergent thinking. 
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Identifying potential users and exploring possible challenges from the findings of the previous stage 
formed the foundation for constructing hypotheses, as presented in Table 4. This represents an advancement 
from previous research conducted by Silvia, where hypotheses were derived solely from developer 
assumptions without a solid basis for their formulation [29]. This limitation poses a significant hurdle for 
novice developers, particularly when they need to gain prior experience in developing user-centric design 
solutions. In contrast, our study incorporates an analysis of comparable or similar design solutions, providing 
developers with a more informed basis. Nevertheless, consistent with earlier research, hypotheses are 
employed to uncover variables that may differentiate users by identifying distinctive needs and behaviors. 


Table 4. Example of hypothesis 


Hipotesis Persona Explanation 
Ho Commuter To find the most convenient and cost-effective train routes for daily commuting to work. 
H1 Leisure traveler To explore new destinations and plan memorable train journey with friends or family. 
H2 Business traveler To efficiently manage business travel arrangements, including train bookings, for himself and his team. 


In the study conducted by Silvia, the term "identified behavioral variables" was introduced as a 
novel concept [29], contrasting with the terminology used in cooper's research [17]. However, we find it 
more suitable to refer to these variables as "observed behavioral variables" since they will serve as the focus 
of our observations. While Silva emphasizes the newness of this activity, cooper's research recommends 
creating a comprehensive list of behavioral variables derived from various aspects of observed behavior. We 
acknowledge Silva's observation that obtaining information through interviews can challenge developers. 
However, our research has yet to provide a detailed explanation of the process of forming this list. Hence, our 
study formulated the list as a compilation of observed insights derived from the SWOT analysis, particularly 
in the opportunity section, as presented in Table 5. 


Table 5. Example of observed behavioral variables 


Persona Variable Scale 
Commuter a. Frequency of train bookings for business purposes. Strongly 
b. Time spent on managing travel arrangements. disagree- 
c. The number of team members included in the bookings. strongly agree 
d. Utilization of corporate account management features. 
e. Engagement with customer support for issue resolution. 
Leisure a. | The number of train bookings made for leisure travel. Strongly 
traveler b. Variety of destinations chosen. disagree- 
c. Interaction with personalized recommendations and itinerary customization options. strongly agree 
d. Engagement with social features for sharing travel experiences. 
e. Utilization of group booking features for traveling with friends or family. 
Business a. Frequency of train bookings for commuting purposes. Strongly 
traveler b. Time spent searching for suitable train routes. disagree- 
c. Utilization of search filters to find convenient schedules. strongly agree 
d. Engagement with features providing cost comparisons and discounts. 
e. Feedback or complaints related to the availability and reliability of train services for commuting. 


Prior to conducting the map interview activity with participants, the developer will transform the 
statements related to the observed behavioral variables into user stories format, following the structure "as 
(user) I want/need (something) so I can do (something)” [36]. This format enhances the clarity and purpose 
of the interview process. According to Silvia's research, the precision of the interview itself is of little 
importance, as the main objective is to explore the relationship between different respondents based on the 
predefined persona derived from the hypothesis. The outcomes of the interviews are represented using 
smileys, with labels obtained from defining the observed variables. The developer's worksheet displays these 
labels in a five-scale format, but respondents can indicate a grayscale that falls between the two 
predetermined scales. An example of a map interview representation is illustrated in Figure 2(a). 
Subsequently, significant variables are identified from the map interview activity. The aim is to identify and 
group respondents who exhibit similar patterns and values based on the proximity of their indicated scales. A 
group typically comprises six to twelve respondents choosing the same or closely related variables. However, 
in our research, we limit the closeness to three to six respondents to gather more comprehensive information, 
without compromising the accuracy of the design solution pursued. A representation of this activity is 
presented in Figure 2(b). The established groups serve as the foundation for constructing a persona. 
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SYNTHESIZE CHARACTERISTICS AND 
RELEVANT GOALS 


MAP INTERVIEW 
Persone: Commuter Osaa 


© © © © e 


Seale t Scale 2 Scale & Scale 4: Seale $ 
Strongly Agree Agree Neutral Disagree Strongly Disagree 


USER FEELING 
Persona: Commuter 


Variable 1 


Variable 2 


Variable 1: As a frequent business commuter, [want the train booking system to offer a seamless 
and efficient process for making frequent bookings, allowing me to easily manage my travel 


arrangements, 


Variable 2: As a busy commmniter, [wank the train booking system to save me time by providing 
quick and intuitive navigation, reducing the time spent on managing travel arrangements and 
making bookings. 


eo o o 


Variable 5 


Variable 3:As a team leader or manager, | want the train booking system to support bookings for 
muitiple team members allowing me to easily Include and manage bookings for my team 


members in a single transaction 


Variable 4: As a corporate traveler, | want the train booking system to provide features for 
managing corporate accounts, such as storing payment information and preferences, fxcilitating 
easier and faster bookings for business purposes 


© © ®© 


Variable 5: As a commuter, | want the train booking system to have reliable customer support 
available, allowing me to quickly resolve any issues or concerns that may arise during the booking 
process or while traveling. ensuring a smooth commuting experience 


© © ® 
(a) (b) 


Variable .. : 


Variable .. : 


Variable . : 


Figure 2. Example of interview: (a) map interview and (b) synthesize characteristics and relevant goals 


An in-depth understanding of the respondents' backgrounds and preferences plays a crucial role in 
guiding developers in defining the objectives of their development efforts. The persona creation process 
serves as a means to represent potential system users. A key aspect of persona development is prioritizing 
target users, ensuring that the developer focuses on specific user groups sample of the persona from the 
method as given in Figure 3. This is achieved through the synthesis of relevant characteristics and goals that 
help developers gain a deeper understanding of their intended user segments. To facilitate this process, this 
study proposes a comprehensive set of persona components, as outlined in Table 6, that enable developers to 
precisely identify the target audience. 


Persona name: liham Roles & tasks: Commuter 


Age: 18 Profession: Student 
User quote: Personality traits/ behaviour: 
ma fan oro tan bea 


boring, but I am happy with the comfortable and spacious 
‘seat options on the train 


Extrovert conscientious 


Lifestyle: 


Motivation: 


Ordinary man 


For some, train travel is a way to explore new placos and 


p, train 
have adventures. 


Skill & Knowledge: 


N] intenet M] smartphones 


Segment: 


As a frequent business commuter, | want the train booking system 
to offer a seamless and efficient process for making frequent 


[_] website M] social media 


bookings, allowing me to manage my travel arrangements easily 


Others 


Sketch persona: 


Figure 3. Example of persona 
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Table 6. Proposed persona structure 


Component 


Content 


Example 


Persona name 


Age 


User quote 


Lifestyle 


Skill and 
knowledge 


Role and tasks 


Profession 


Personality 


traits/behaviour 


Motivation 


Segment 


Sketch persona 


Target users with specific characteristics, goals, and 
pain points, enabling designers and developers to 
create solutions tailored to their needs and 
preferences 

Age information helps create a more realistic and 
relatable persona that aligns with the target 
audience's demographics, needs, and behaviors. 


User quotes provide direct insights and perspectives 
from real users, capturing their thoughts, opinions, 
and experiences. 


Represents the daily routines, preferences, and 
behaviors of the target user 


Represents the user's proficiency and familiarity 
with various digital tools and platforms. 


Represents the user's specific responsibilities, roles, 
and tasks in relation to the product or service being 
analyzed. 

Represents the user's occupation, field of work, or 
professional background 


Represents the characteristic patterns of thoughts, 
feelings, and behaviors that influence how a user 
interacts with a product or service 


Represents the underlying needs, desires, and goals 
that drive a user's behavior and decision-making 
process. 


Represents the specific target audience or user 
group with similar characteristics, needs, or 
preferences. 


The visual representation of the persona and 
includes key information and characteristics 


Ilham: identified Ilham's strength in efficiently 
managing travel arrangements and her expertise 
in finding cost-effective train routes. 


18 years: consider any strengths identified in 
the SWOT analysis that are related to age, such 
as a persona's familiarity with technology or 
specific preferences that align with a particular 
age group. 

Waiting at the train station before the trip can 
be a bit boring, but i am happy with the 
comfortable and spacious seat options on the 
train: identify strengths of the product or 
service that can be highlighted in user quotes to 
showcase the positive aspects experienced by 
users. 

Ordinary man: if the product offers convenience 
and time-saving features, it can appeal to users 
with a busy or fast-paced lifestyle. 

If the product offers a user-friendly interface or 
integrates well with popular social media 
platforms, it can appeal to users who are 
already proficient in using such tools. 
Commuter: identify the strengths of the product 
or service that align with specific user roles or 
tasks 

Student: if the product offers advanced data 
analysis capabilities, it may be well-suited for 
professionals in data-driven fields such as 
marketing or finance. 

Extrovet conscientious: if the product offers a 
user-friendly interface and intuitive navigation, 
it may appeal to users who prefer simplicity and 
ease of use. 

If the product offers time-saving features or 
cost-effective solutions, it may appeal to users 
motivated by efficiency or financial 
considerations. 

As a frequent business commuter, i want the 
train booking system to offer a seamless and 
efficient process for making frequent bookings, 
allowing me to easily manage my travel 
arrangements. 


For novice developers, a clear understanding of the target audience is paramount to ensure the 
development efforts align with user expectations. Unlike previous research conducted by Silvia, which 
categorized respondents into primary and secondary groups, this study treats all respondents equally and 
considers them valuable development references. Furthermore, the map interview conducted during the 
research process allows developers to obtain valuable insights and elicit system requirements directly from 
the respondents. The outcomes of the identification process outlined in Table 6 are subsequently transformed 
into personas, which are documented in the UX journey worksheet. It is important to note that personas are 
fictional representations that may not necessarily correspond to an individual respondent but serve as 
archetypes that embody the characteristics and needs of specific user groups. 


3.2. Statistical analysis 

Statistical analysis is a crucial component of research that involves applying various statistical 
techniques to analyze and interpret data. It plays a significant role in gaining insights, making inferences, and 
drawing conclusions from research findings. 

In project 1 (Table 7), the SWOT analysis revealed a minimum score of 1, a maximum score of 180, 
an average score of 16, and a standard deviation of 19. This suggests a moderate level of evaluation across 
the strengths, weaknesses, opportunities, and threats. The competitor analysis indicated a range of 3 to 200, 
with an average of 20 and a standard deviation of 19, indicating a relatively high level of competition. The 
hypothesis and identified behavioral variable analysis showed a minimum score of 2, a maximum score of 
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60, an average score of 11, and a standard deviation of 8.1, indicating a reasonable understanding of user 
needs and behaviors. The map interview analysis had a range of 2 to 100, an average score of 12, and a 
standard deviation of 13, suggesting moderate user engagement. The characteristics and relevant goals 
analysis showed a range of 2 to 90, an average score of 19, and a standard deviation of 16, indicating a good 
understanding of user characteristics and goals. The persona analysis had a minimum score of 1, a maximum 
score of 75, an average score of 11, and a standard deviation of 11, suggesting room for improvement in 
persona development. 


Table 7. Time to complete (in minutes) 


Project 1 Project 2 Project 3 
Activity Min Max Avg STD Min Max Avg Std Min Max Avg STD 
Swot 1 180 16 19 1 180 16 19 1 180 16 19 
Competitor 3 200 20 19 1.2 200 21 21 3 200 20 19 
Hypothesis 2 60 11 81 1 80 13 12 2 60 11 81 
Identify behavioral variable 1 60 12 88 1 90 14 13 1 60 12 88 
Map interview 2 100 12 13 14 120 15 19 2 100 12 13 
Characteristics and relevant goals 2 90 19 16 2 120 19 18 2 90 19 16 
Persona 1 75 11 11 1 60 14 11 1 75 11 11 


In project 2 (Table 7), the SWOT analysis had a minimum score of 1, a maximum score of 180, an 
average score of 16, and a standard deviation of 19, indicating a similar evaluation as in project 1. The 
competitor analysis showed a range of 1 to 200, an average score of 21, and a standard deviation of 21, 
suggesting a slightly higher competition level than project 1. The hypothesis and identified behavioral 
variable analysis had a minimum score of 1, a maximum score of 80, an average score of 13, and a standard 
deviation of 12, indicating a reasonable understanding of user needs and behaviors. The map interview 
analysis had a range of | to 4, an average score of 120, and a standard deviation of 15, indicating a moderate 
level of engagement with users. The characteristics and relevant goals analysis showed a range of 2 to 120, 
an average score of 19, and a standard deviation of 18, suggesting a good understanding of user 
characteristics and goals. The persona analysis had a minimum score of 1, a maximum score of 60, an 
average score of 14, and a standard deviation of 11, indicating room for improvement in persona 
development. 

In project 3 (Table 7), the SWOT analysis had a minimum score of 1, a maximum score of 180, an 
average score of 16, and a standard deviation of 19, similar to the previous projects. The competitor analysis 
indicated a range of 3 to 200, an average score of 20, and a standard deviation of 19, showing a similar level 
of competition as in project 1. The hypothesis and identified behavioral variable analysis had a minimum 
score of 2, a maximum score of 60, an average score of 11, and a standard deviation of 8.1, indicating a 
similar understanding of user needs and behaviors as in project 1. The map interview analysis had a range of 
2 to 100, an average score of 12, and a standard deviation of 13, suggesting a similar level of user 
engagement as in project 1. The characteristics and relevant goals analysis showed a range of 2 to 90, an 
average score of 19, and a standard deviation of 16, indicating a similar understanding of user characteristics 
and goals as in project 1. The persona analysis had a minimum score of 1, a maximum score of 75, an 
average score of 11, and a standard deviation of 11, suggesting a need for improvement in persona 
development, similar to project 1. 


3.3. Analyzing the similarity between activity 

Statistical analysis is a crucial component of research that involves applying various statistical 
techniques to analyze and interpret data. It plays a significant role in gaining insights, making inferences, and 
drawing conclusions from research findings. 


3.3.1. Modelling as Markov chains 

The Markov chain process is a mathematical analysis process that models a system's transition from 
one state to another. It is a stochastic process that satisfies the Markov property, which states that the future 
state depends only on the current state and is independent of the past states. In a Markov chain, a system can 
exist in a finite or countably infinite number of states. Transition probabilities govern the transition from one 
state to another. These probabilities represent the likelihood of moving from one state to another in a single 
step. The mathematical analysis of a Markov chain involves studying the transition probabilities and using 
them to calculate various system properties. Some common analyses include determining the steady-state 
probabilities (also known as the stationary distribution), representing the system's long-term behavior, 
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calculating expected values and probabilities of reaching specific states, and predicting future events based 
on the current state (1). 


(h) (h) (h) 

Pà Po Pin 

h) — (h) — | p™® (h) (h) 
PW = |2; NxN Poa Py» Pyn () 

(h) (h) (h) 

Paa Pia Fan 


This matrix is also stochastic because each row represents the conditional distribution. Where P is 
the transitional matrix, and |P; ilnen'® the transitional matrix with NxN probability in discrete time Markov 


chain with the conditions {X,,,n > 0} in the state space h = {1, 2, ..., N} based on space that predict. From 


this condition P;; = i,j < Nand for the first space, 2i =1 Pi = 1,1 < i < N. Por the 1|-step, 2-step, n-step 


transition, as (2)-(4): 


p® = p©) x P (2) 
PĒ = P® x P = P® xP x P = PO x P? (3) 
P® = p@-1) x pa... = pO x pr (4) 


The transition probabilities in the provided Markov chain describe the likelihood of moving between 
different process stages. The chain starts with the SWOT stage, where the transition to the competitor stage is 
probability 1, meaning it always occurs. From the competitor stage, the process always moves to the 
hypothesis stage with a probability of 1. A sample task process is given in Figure 4, describing each step's 
flow and probability. 


Matrix of 
[next_state] = | transition | [curent_state| (5) 
probabilities 

X,=X)xP (6) 
Ay 1 167/0.158 0.158 + 0.198 0.356 
By 1 20}} 0.198 0.198 0.198 
Cy 1 11]}0.108 0.108 0.108 
Di) = 0.33 12||0.118 |= 0.33 x 0.108 =| 0.0356 
Ey 0.33 0.5 12||0.118 0.33 x 0.108 + 0.5 x 0.118 0.0946 
Fi 0.33 05 1 19||0.188ļ| |0.33 x 0.108 + 0.5 x 0.118 + 1 x 0.188| | 0.282 
Gy 1 44114140.108 0.108 0.108 


0,33 Q 
Figure 4. A sample task process of average productivity modeled as a Markov chain 


Within the hypothesis stage are three possible transitions, each with a probability of 0.33. This 
indicates an equal chance of moving to identify behavioral variable, map interview, or characteristics and 
relevant goals. Once in the identify behavioral variable stage, there is an equal probability of 0.5 to transition 
either to the map interview stage or the characteristics and relevant goals stage. The transition from map 
interview to characteristics and relevant goals is certain, with a probability of 1. Markov chain represents a 
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sequential process where the initial stages (SWOT, competitor) always lead to subsequent stages (hypothesis, 
identify behavioral variable), and from there, the process branches out with equal probabilities to different 
stages (map interview, characteristics and relevant goals) before eventually reaching the final stage persona. 


3.3.2. Difference between high-average activity 

Average productivity refers to the typical or usual level of productivity observed within a given 
context. It represents the average or mean level of output or efficiency achieved by the entities being 
considered. Hence, high productivity refers to a level of productivity that exceeds the average or typical 
performance. It represents a higher-than-normal output or efficiency level achieved by individuals. High 
productivity implies that the entities are performing exceptionally well and achieving output levels 
surpassing the average or standard. 

In project 1 (Figure 5), the minimum value observed is 22.5, and the maximum value is 490, 
indicating the range of the data. The first quartile (Q1) is 58, representing the 25th percentile and the third 
quartile (Q3) is 116, representing the 75th percentile. The median, calculated as 80, represents the midpoint 
of the data. The mean for project 1 is approximately 100.08, indicating the overall average. The interquartile 
range (IQR), measuring the spread of the middle 50% of the data, is 58. The lower limit is -29 and the upper 
limit is 203, defining the expected range for most of the data. 
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Figure 5. Difference high-average activity 


In project 2 (Figure 5), the minimum and maximum values are 20 and 490, respectively, showing 
the range of the data. The first quartile is 58, the median is 88, and the third quartile is 140. The mean for 
project 2 is approximately 112.51, slightly higher than that of project 1. The IQR is calculated to be 82, 
indicating a wider data spread. The lower limit is -65, and the upper limit is 263. Project 3 (Figure 5) 
statistical measures are identical to those of projects 1 and 2, with the same minimum, maximum, quartiles, 
median, mean, IQR, lower limit, and upper limit. Considering minimum and maximum values provides 
insights into the potential presence of outliers or extreme data points. The range of values from 22.5 to 490 
indicates the variability within each project's data. 


3.5. Discussion 

Software development plays a critical role in improving productivity within the realm of software 
engineering. As the introduction highlights, software products have far-reaching implications across various 
dimensions, including technical, social, emotional aspects, work-life dynamics, and collaborative teamwork. 
To ensure that software effectively meets the needs of end users, developers employ personas to understand 
user characteristics and requirements. The persona activity involves creating representations of target user 
groups, enabling developers to gain valuable insights into user preferences, behaviors, and goals. Compared 
to previous research [29], this research provides a comprehensive solution for representing user targets using 
characteristic and relevant goals. This understanding helps guide the design and development process, 
ensuring that software meets user needs and expectations. However, user persona development has 
challenges. Balancing qualitative and quantitative techniques can be complex, and the subjective 
determination of personas can pose difficulties, particularly when integrating them into the software 
development process. 

This is where the application of a Markov chain model becomes relevant. By modeling developer 
productivity and predicting the probabilities of the next states, the Markov chain offers a powerful tool for 
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optimizing software development processes. The model considers historical or real-time data, estimating 
transition probabilities based on past observations. This predictive capability enables project managers and 
team leads to plan resources effectively, allocate tasks efficiently, and identify potential areas for 
improvement. 

Integrating the Markov chain model with the persona activity provides a comprehensive approach to 
software development. By incorporating user characteristics, behaviors, and goals derived from personas, the 
model can predict the next state of developer productivity more accurately. This correlation between persona 
insights and the Markov chain's predictive power allows a more nuanced understanding of how user 
requirements and preferences impact developer productivity. However, it is important to note that the 
accuracy of the Markov chain model relies heavily on the quality and relevance of the data used to estimate 
transition probabilities. Additionally, the model assumes that future developer behavior depends solely on 
their current state and is independent of their history. These assumptions, while simplifying the modeling 
process, may only sometimes capture the complexity of real-world scenarios. Integrating the Markov chain 
model with the persona activity offers a powerful approach to optimizing software development processes. 
By leveraging persona insights and predicting probabilities of the next states, organizations can enhance 
resource allocation, identify areas for improvement, and ultimately deliver software that better meets the 
needs and expectations of end users. Further research and refinement of the model and the continued use of 
personas can contribute to more accurate predictions and improved software development outcomes. 


4. CONCLUSION 

Integrating the Markov chain model in the software development process offers valuable insights 
into predicting the next state of developer productivity. By following a structured sequence of activities, such 
as conducting a SWOT analysis, analyzing competitors, formulating hypotheses, identifying behavioral 
variables, mapping interviews, and creating personas, organizations can gather relevant data to feed into the 
Markov chain model. This predictive approach enables better resource allocation and decision-making, 
ultimately improving project planning and successful software development outcomes. By harnessing the 
power of the Markov chain and leveraging the information obtained from the persona-building process in 
projects 1, 2, and 3, organizations can enhance their ability to forecast developer productivity and optimize 
their software development endeavors. Integrating quantitative analysis and qualitative understanding 
provides a valuable framework for improving efficiency and productivity in software development projects. 
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